Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 4258 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 532.4 KiB |
| Average record size in memory | 128.0 B |
Variable types
| Numeric | 15 |
|---|---|
| Boolean | 1 |
Clay is highly correlated with Sand and 2 other fields | High correlation |
Sand is highly correlated with Clay and 1 other fields | High correlation |
Silt is highly correlated with Sand and 1 other fields | High correlation |
pH(CaCl2) is highly correlated with Clay and 2 other fields | High correlation |
pH(H2O) is highly correlated with Clay and 2 other fields | High correlation |
EC is highly correlated with N | High correlation |
OC is highly correlated with N | High correlation |
CaCO3 is highly correlated with pH(CaCl2) and 1 other fields | High correlation |
N is highly correlated with Silt and 2 other fields | High correlation |
Clay is highly correlated with Sand and 2 other fields | High correlation |
Sand is highly correlated with Clay and 1 other fields | High correlation |
Silt is highly correlated with Sand | High correlation |
pH(CaCl2) is highly correlated with Clay and 2 other fields | High correlation |
pH(H2O) is highly correlated with Clay and 2 other fields | High correlation |
EC is highly correlated with N | High correlation |
OC is highly correlated with N | High correlation |
CaCO3 is highly correlated with pH(CaCl2) and 1 other fields | High correlation |
N is highly correlated with EC and 1 other fields | High correlation |
Clay is highly correlated with Sand | High correlation |
Sand is highly correlated with Clay and 1 other fields | High correlation |
Silt is highly correlated with Sand | High correlation |
pH(CaCl2) is highly correlated with pH(H2O) and 1 other fields | High correlation |
pH(H2O) is highly correlated with pH(CaCl2) and 1 other fields | High correlation |
OC is highly correlated with N | High correlation |
CaCO3 is highly correlated with pH(CaCl2) and 1 other fields | High correlation |
N is highly correlated with OC | High correlation |
df_index is highly correlated with Point_ID and 1 other fields | High correlation |
Point_ID is highly correlated with df_index and 1 other fields | High correlation |
Revisited_point is highly correlated with df_index and 1 other fields | High correlation |
Clay is highly correlated with Sand and 3 other fields | High correlation |
Sand is highly correlated with Clay and 1 other fields | High correlation |
Silt is highly correlated with Clay and 1 other fields | High correlation |
pH(CaCl2) is highly correlated with Clay and 2 other fields | High correlation |
pH(H2O) is highly correlated with Clay and 2 other fields | High correlation |
EC is highly correlated with N and 1 other fields | High correlation |
OC is highly correlated with N | High correlation |
CaCO3 is highly correlated with pH(CaCl2) and 1 other fields | High correlation |
N is highly correlated with EC and 1 other fields | High correlation |
K is highly correlated with EC | High correlation |
df_index has unique values | Unique |
Point_ID has unique values | Unique |
Coarse has 45 (1.1%) zeros | Zeros |
CaCO3 has 1364 (32.0%) zeros | Zeros |
P has 675 (15.9%) zeros | Zeros |
Reproduction
| Analysis started | 2022-06-07 03:41:56.856217 |
|---|---|
| Analysis finished | 2022-06-07 03:42:54.473579 |
| Duration | 57.62 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 4258 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2222.00822 |
| Minimum | 4 |
|---|---|
| Maximum | 21025 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.4 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 216.85 |
| Q1 | 1070.25 |
| median | 2137.5 |
| Q3 | 3201.75 |
| 95-th percentile | 4057.15 |
| Maximum | 21025 |
| Range | 21021 |
| Interquartile range (IQR) | 2131.5 |
Descriptive statistics
| Standard deviation | 1797.120631 |
|---|---|
| Coefficient of variation (CV) | 0.8087821707 |
| Kurtosis | 54.06963899 |
| Mean | 2222.00822 |
| Median Absolute Deviation (MAD) | 1066 |
| Skewness | 5.394471118 |
| Sum | 9461311 |
| Variance | 3229642.564 |
| Monotonicity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2049 | 1 | < 0.1% |
| 3395 | 1 | < 0.1% |
| 3367 | 1 | < 0.1% |
| 1322 | 1 | < 0.1% |
| 3371 | 1 | < 0.1% |
| 1326 | 1 | < 0.1% |
| 3375 | 1 | < 0.1% |
| 1330 | 1 | < 0.1% |
| 3379 | 1 | < 0.1% |
| 1334 | 1 | < 0.1% |
| Other values (4248) | 4248 |
| Value | Count | Frequency (%) |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 | |
| 11 | 1 | |
| 12 | 1 | |
| 13 | 1 |
| Value | Count | Frequency (%) |
| 21025 | 1 | |
| 21024 | 1 | |
| 21023 | 1 | |
| 21022 | 1 | |
| 21021 | 1 | |
| 21020 | 1 | |
| 21019 | 1 | |
| 21018 | 1 | |
| 21017 | 1 | |
| 21016 | 1 |
| Distinct | 4258 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40562988.21 |
| Minimum | 28061794 |
|---|---|
| Maximum | 64981672 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.4 KiB |
Quantile statistics
| Minimum | 28061794 |
|---|---|
| 5-th percentile | 30002319.3 |
| Q1 | 32862168 |
| median | 39762407 |
| Q3 | 46862647 |
| 95-th percentile | 54601857.4 |
| Maximum | 64981672 |
| Range | 36919878 |
| Interquartile range (IQR) | 14000479 |
Descriptive statistics
| Standard deviation | 8455944.61 |
|---|---|
| Coefficient of variation (CV) | 0.2084645383 |
| Kurtosis | -0.579222749 |
| Mean | 40562988.21 |
| Median Absolute Deviation (MAD) | 6960509 |
| Skewness | 0.5222537656 |
| Sum | 1.727172038 × 1011 |
| Variance | 7.150299924 × 1013 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 41062402 | 1 | < 0.1% |
| 31442322 | 1 | < 0.1% |
| 38082372 | 1 | < 0.1% |
| 32802170 | 1 | < 0.1% |
| 34403708 | 1 | < 0.1% |
| 31522114 | 1 | < 0.1% |
| 38643070 | 1 | < 0.1% |
| 32462208 | 1 | < 0.1% |
| 49644930 | 1 | < 0.1% |
| 41043334 | 1 | < 0.1% |
| Other values (4248) | 4248 |
| Value | Count | Frequency (%) |
| 28061794 | 1 | |
| 28102276 | 1 | |
| 28142280 | 1 | |
| 28181874 | 1 | |
| 28182282 | 1 | |
| 28201786 | 1 | |
| 28201818 | 1 | |
| 28202170 | 1 | |
| 28221888 | 1 | |
| 28261904 | 1 |
| Value | Count | Frequency (%) |
| 64981672 | 1 | |
| 64961676 | 1 | |
| 64901672 | 1 | |
| 64901668 | 1 | |
| 64881666 | 1 | |
| 64841670 | 1 | |
| 64841666 | 1 | |
| 64821668 | 1 | |
| 64801668 | 1 | |
| 64661660 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.3 KiB |
| False | |
|---|---|
| True | 25 |
| Value | Count | Frequency (%) |
| False | 4233 | |
| True | 25 | 0.6% |
| Distinct | 87 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21.63386566 |
| Minimum | 0 |
|---|---|
| Maximum | 90 |
| Zeros | 45 |
| Zeros (%) | 1.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 11 |
| median | 19 |
| Q3 | 30 |
| 95-th percentile | 49 |
| Maximum | 90 |
| Range | 90 |
| Interquartile range (IQR) | 19 |
Descriptive statistics
| Standard deviation | 14.80410171 |
|---|---|
| Coefficient of variation (CV) | 0.6843021926 |
| Kurtosis | 1.025218181 |
| Mean | 21.63386566 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.9487540972 |
| Sum | 92117 |
| Variance | 219.1614274 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 11 | 137 | 3.2% |
| 17 | 131 | 3.1% |
| 23 | 129 | 3.0% |
| 12 | 128 | 3.0% |
| 19 | 126 | 3.0% |
| 20 | 123 | 2.9% |
| 15 | 121 | 2.8% |
| 4 | 120 | 2.8% |
| 14 | 118 | 2.8% |
| 18 | 117 | 2.7% |
| Other values (77) | 3008 |
| Value | Count | Frequency (%) |
| 0 | 45 | 1.1% |
| 1 | 91 | |
| 2 | 84 | |
| 3 | 112 | |
| 4 | 120 | |
| 5 | 112 | |
| 6 | 94 | |
| 7 | 91 | |
| 8 | 97 | |
| 9 | 109 |
| Value | Count | Frequency (%) |
| 90 | 1 | < 0.1% |
| 86 | 2 | < 0.1% |
| 84 | 1 | < 0.1% |
| 83 | 3 | |
| 82 | 1 | < 0.1% |
| 81 | 1 | < 0.1% |
| 80 | 1 | < 0.1% |
| 79 | 2 | < 0.1% |
| 78 | 1 | < 0.1% |
| 77 | 5 |
| Distinct | 62 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.20854861 |
| Minimum | 0 |
|---|---|
| Maximum | 62 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 11 |
| median | 18 |
| Q3 | 26 |
| 95-th percentile | 39 |
| Maximum | 62 |
| Range | 62 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 10.72489841 |
|---|---|
| Coefficient of variation (CV) | 0.5583398632 |
| Kurtosis | 0.1905720927 |
| Mean | 19.20854861 |
| Median Absolute Deviation (MAD) | 7.5 |
| Skewness | 0.7247110128 |
| Sum | 81790 |
| Variance | 115.0234458 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 9 | 182 | 4.3% |
| 11 | 168 | 3.9% |
| 10 | 167 | 3.9% |
| 12 | 154 | 3.6% |
| 8 | 153 | 3.6% |
| 14 | 151 | 3.5% |
| 21 | 150 | 3.5% |
| 13 | 150 | 3.5% |
| 6 | 150 | 3.5% |
| 18 | 150 | 3.5% |
| Other values (52) | 2683 |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 1 | 12 | 0.3% |
| 2 | 24 | 0.6% |
| 3 | 50 | 1.2% |
| 4 | 78 | |
| 5 | 102 | |
| 6 | 150 | |
| 7 | 122 | |
| 8 | 153 | |
| 9 | 182 |
| Value | Count | Frequency (%) |
| 62 | 1 | < 0.1% |
| 60 | 1 | < 0.1% |
| 59 | 1 | < 0.1% |
| 58 | 2 | < 0.1% |
| 57 | 4 | |
| 56 | 5 | |
| 55 | 3 | |
| 54 | 2 | < 0.1% |
| 53 | 3 | |
| 52 | 7 |
| Distinct | 95 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.41874119 |
| Minimum | 2 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.4 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 22 |
| median | 34 |
| Q3 | 50 |
| 95-th percentile | 74 |
| Maximum | 100 |
| Range | 98 |
| Interquartile range (IQR) | 28 |
Descriptive statistics
| Standard deviation | 19.17138548 |
|---|---|
| Coefficient of variation (CV) | 0.512347152 |
| Kurtosis | -0.3704194592 |
| Mean | 37.41874119 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 0.5732820738 |
| Sum | 159329 |
| Variance | 367.5420212 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 33 | 111 | 2.6% |
| 20 | 101 | 2.4% |
| 29 | 98 | 2.3% |
| 21 | 95 | 2.2% |
| 23 | 95 | 2.2% |
| 22 | 94 | 2.2% |
| 19 | 91 | 2.1% |
| 26 | 89 | 2.1% |
| 27 | 88 | 2.1% |
| 28 | 87 | 2.0% |
| Other values (85) | 3309 |
| Value | Count | Frequency (%) |
| 2 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 5 | 0.1% |
| 5 | 8 | 0.2% |
| 6 | 20 | 0.5% |
| 7 | 11 | 0.3% |
| 8 | 32 | |
| 9 | 39 | |
| 10 | 43 | |
| 11 | 55 |
| Value | Count | Frequency (%) |
| 100 | 2 | < 0.1% |
| 97 | 1 | < 0.1% |
| 96 | 2 | < 0.1% |
| 94 | 1 | < 0.1% |
| 93 | 5 | |
| 92 | 4 | |
| 91 | 2 | < 0.1% |
| 90 | 3 | 0.1% |
| 88 | 5 | |
| 87 | 8 |
| Distinct | 72 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43.36683889 |
| Minimum | 0 |
|---|---|
| Maximum | 72 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 35 |
| median | 45 |
| Q3 | 53 |
| 95-th percentile | 61 |
| Maximum | 72 |
| Range | 72 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 12.5456273 |
|---|---|
| Coefficient of variation (CV) | 0.289290795 |
| Kurtosis | -0.1798274759 |
| Mean | 43.36683889 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.5164259259 |
| Sum | 184656 |
| Variance | 157.3927643 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 52 | 144 | 3.4% |
| 44 | 141 | 3.3% |
| 49 | 139 | 3.3% |
| 50 | 138 | 3.2% |
| 46 | 135 | 3.2% |
| 53 | 135 | 3.2% |
| 48 | 132 | 3.1% |
| 47 | 132 | 3.1% |
| 51 | 132 | 3.1% |
| 45 | 130 | 3.1% |
| Other values (62) | 2900 |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 4 | |
| 6 | 3 | 0.1% |
| 7 | 3 | 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 5 | |
| 10 | 8 |
| Value | Count | Frequency (%) |
| 72 | 1 | < 0.1% |
| 71 | 3 | 0.1% |
| 70 | 2 | < 0.1% |
| 69 | 4 | 0.1% |
| 68 | 7 | 0.2% |
| 67 | 14 | 0.3% |
| 66 | 19 | |
| 65 | 25 | |
| 64 | 32 | |
| 63 | 40 |
| Distinct | 54 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.822475341 |
| Minimum | 2.8 |
|---|---|
| Maximum | 8.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.4 KiB |
Quantile statistics
| Minimum | 2.8 |
|---|---|
| 5-th percentile | 3.7 |
| Q1 | 4.5 |
| median | 5.9 |
| Q3 | 7.2 |
| 95-th percentile | 7.6 |
| Maximum | 8.5 |
| Range | 5.7 |
| Interquartile range (IQR) | 2.7 |
Descriptive statistics
| Standard deviation | 1.368199789 |
|---|---|
| Coefficient of variation (CV) | 0.2349859311 |
| Kurtosis | -1.389999614 |
| Mean | 5.822475341 |
| Median Absolute Deviation (MAD) | 1.3 |
| Skewness | -0.1952792444 |
| Sum | 24792.1 |
| Variance | 1.871970662 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 7.4 | 234 | 5.5% |
| 7.5 | 230 | 5.4% |
| 7.3 | 201 | 4.7% |
| 7.2 | 195 | 4.6% |
| 7.6 | 170 | 4.0% |
| 7.1 | 165 | 3.9% |
| 7 | 136 | 3.2% |
| 4.4 | 124 | 2.9% |
| 4.1 | 118 | 2.8% |
| 4.2 | 118 | 2.8% |
| Other values (44) | 2567 |
| Value | Count | Frequency (%) |
| 2.8 | 2 | < 0.1% |
| 3 | 6 | 0.1% |
| 3.1 | 10 | 0.2% |
| 3.2 | 22 | 0.5% |
| 3.3 | 23 | 0.5% |
| 3.4 | 37 | |
| 3.5 | 39 | |
| 3.6 | 43 | |
| 3.7 | 68 | |
| 3.8 | 80 |
| Value | Count | Frequency (%) |
| 8.5 | 1 | < 0.1% |
| 8.1 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 7.9 | 4 | 0.1% |
| 7.8 | 17 | 0.4% |
| 7.7 | 59 | 1.4% |
| 7.6 | 170 | |
| 7.5 | 230 | |
| 7.4 | 234 | |
| 7.3 | 201 |
| Distinct | 485 |
|---|---|
| Distinct (%) | 11.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.1819845 |
| Minimum | 3.47 |
|---|---|
| Maximum | 9.05 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.4 KiB |
Quantile statistics
| Minimum | 3.47 |
|---|---|
| 5-th percentile | 4.1185 |
| Q1 | 4.95 |
| median | 6.16 |
| Q3 | 7.51 |
| 95-th percentile | 8.03 |
| Maximum | 9.05 |
| Range | 5.58 |
| Interquartile range (IQR) | 2.56 |
Descriptive statistics
| Standard deviation | 1.353989184 |
|---|---|
| Coefficient of variation (CV) | 0.2190217695 |
| Kurtosis | -1.366943265 |
| Mean | 6.1819845 |
| Median Absolute Deviation (MAD) | 1.29 |
| Skewness | -0.09573515043 |
| Sum | 26322.89 |
| Variance | 1.833286712 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 7.95 | 27 | 0.6% |
| 7.85 | 27 | 0.6% |
| 7.8 | 25 | 0.6% |
| 8 | 25 | 0.6% |
| 7.6 | 25 | 0.6% |
| 7.93 | 24 | 0.6% |
| 7.88 | 23 | 0.5% |
| 4.69 | 21 | 0.5% |
| 7.89 | 20 | 0.5% |
| 7.75 | 20 | 0.5% |
| Other values (475) | 4021 |
| Value | Count | Frequency (%) |
| 3.47 | 1 | < 0.1% |
| 3.53 | 1 | < 0.1% |
| 3.54 | 1 | < 0.1% |
| 3.56 | 3 | |
| 3.59 | 1 | < 0.1% |
| 3.6 | 1 | < 0.1% |
| 3.61 | 1 | < 0.1% |
| 3.62 | 1 | < 0.1% |
| 3.65 | 2 | |
| 3.66 | 4 |
| Value | Count | Frequency (%) |
| 9.05 | 1 | |
| 8.91 | 1 | |
| 8.69 | 1 | |
| 8.56 | 1 | |
| 8.55 | 1 | |
| 8.47 | 1 | |
| 8.45 | 1 | |
| 8.44 | 1 | |
| 8.42 | 1 | |
| 8.4 | 1 |
| Distinct | 1973 |
|---|---|
| Distinct (%) | 46.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.73511977 |
| Minimum | 1.73 |
|---|---|
| Maximum | 599.6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.4 KiB |
Quantile statistics
| Minimum | 1.73 |
|---|---|
| 5-th percentile | 4.2485 |
| Q1 | 11.015 |
| median | 17.775 |
| Q3 | 32.1 |
| 95-th percentile | 85.075 |
| Maximum | 599.6 |
| Range | 597.87 |
| Interquartile range (IQR) | 21.085 |
Descriptive statistics
| Standard deviation | 32.80522114 |
|---|---|
| Coefficient of variation (CV) | 1.18280438 |
| Kurtosis | 42.65886011 |
| Mean | 27.73511977 |
| Median Absolute Deviation (MAD) | 8.86 |
| Skewness | 4.792975001 |
| Sum | 118096.14 |
| Variance | 1076.182534 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 21 | 16 | 0.4% |
| 20.2 | 16 | 0.4% |
| 20.4 | 12 | 0.3% |
| 24.8 | 11 | 0.3% |
| 24.3 | 11 | 0.3% |
| 20.8 | 11 | 0.3% |
| 26.4 | 11 | 0.3% |
| 20.9 | 10 | 0.2% |
| 23.5 | 10 | 0.2% |
| 20.6 | 10 | 0.2% |
| Other values (1963) | 4140 |
| Value | Count | Frequency (%) |
| 1.73 | 1 | |
| 2.09 | 2 | |
| 2.12 | 1 | |
| 2.13 | 1 | |
| 2.14 | 1 | |
| 2.15 | 1 | |
| 2.16 | 1 | |
| 2.19 | 1 | |
| 2.26 | 1 | |
| 2.28 | 1 |
| Value | Count | Frequency (%) |
| 599.6 | 1 | |
| 405 | 1 | |
| 374 | 1 | |
| 340 | 1 | |
| 327 | 1 | |
| 308 | 1 | |
| 277 | 1 | |
| 264 | 1 | |
| 262 | 1 | |
| 259 | 1 |
| Distinct | 1198 |
|---|---|
| Distinct (%) | 28.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 46.20246595 |
| Minimum | 0.4 |
|---|---|
| Maximum | 517.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.4 KiB |
Quantile statistics
| Minimum | 0.4 |
|---|---|
| 5-th percentile | 6.5 |
| Q1 | 15.2 |
| median | 29.1 |
| Q3 | 54 |
| 95-th percentile | 137.615 |
| Maximum | 517.2 |
| Range | 516.8 |
| Interquartile range (IQR) | 38.8 |
Descriptive statistics
| Standard deviation | 59.05191066 |
|---|---|
| Coefficient of variation (CV) | 1.278111665 |
| Kurtosis | 21.95284535 |
| Mean | 46.20246595 |
| Median Absolute Deviation (MAD) | 16.5 |
| Skewness | 4.085733597 |
| Sum | 196730.1 |
| Variance | 3487.128152 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 9.8 | 16 | 0.4% |
| 20.5 | 16 | 0.4% |
| 11.7 | 16 | 0.4% |
| 16 | 16 | 0.4% |
| 15.2 | 15 | 0.4% |
| 15.3 | 15 | 0.4% |
| 11.1 | 14 | 0.3% |
| 11.6 | 14 | 0.3% |
| 14.9 | 14 | 0.3% |
| 10.8 | 14 | 0.3% |
| Other values (1188) | 4108 |
| Value | Count | Frequency (%) |
| 0.4 | 1 | < 0.1% |
| 0.7 | 2 | |
| 0.8 | 1 | < 0.1% |
| 0.9 | 2 | |
| 1 | 2 | |
| 1.1 | 1 | < 0.1% |
| 1.2 | 2 | |
| 1.6 | 1 | < 0.1% |
| 1.9 | 1 | < 0.1% |
| 2.1 | 4 |
| Value | Count | Frequency (%) |
| 517.2 | 1 | |
| 513.6 | 1 | |
| 506.7 | 1 | |
| 506 | 1 | |
| 502.2 | 1 | |
| 497.3 | 1 | |
| 494.7 | 1 | |
| 494.1 | 1 | |
| 493.7 | 1 | |
| 490.5 | 1 |
| Distinct | 594 |
|---|---|
| Distinct (%) | 14.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 86.6209488 |
| Minimum | 0 |
|---|---|
| Maximum | 907 |
| Zeros | 1364 |
| Zeros (%) | 32.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 81.75 |
| 95-th percentile | 504 |
| Maximum | 907 |
| Range | 907 |
| Interquartile range (IQR) | 81.75 |
Descriptive statistics
| Standard deviation | 169.1137744 |
|---|---|
| Coefficient of variation (CV) | 1.952342669 |
| Kurtosis | 4.294532975 |
| Mean | 86.6209488 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.226698018 |
| Sum | 368832 |
| Variance | 28599.46869 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 1364 | |
| 1 | 1041 | |
| 2 | 115 | 2.7% |
| 3 | 88 | 2.1% |
| 4 | 41 | 1.0% |
| 5 | 31 | 0.7% |
| 7 | 24 | 0.6% |
| 6 | 24 | 0.6% |
| 8 | 20 | 0.5% |
| 55 | 19 | 0.4% |
| Other values (584) | 1491 |
| Value | Count | Frequency (%) |
| 0 | 1364 | |
| 1 | 1041 | |
| 2 | 115 | 2.7% |
| 3 | 88 | 2.1% |
| 4 | 41 | 1.0% |
| 5 | 31 | 0.7% |
| 6 | 24 | 0.6% |
| 7 | 24 | 0.6% |
| 8 | 20 | 0.5% |
| 9 | 7 | 0.2% |
| Value | Count | Frequency (%) |
| 907 | 1 | |
| 898 | 1 | |
| 878 | 1 | |
| 846 | 1 | |
| 816 | 1 | |
| 815 | 1 | |
| 794 | 1 | |
| 789 | 2 | |
| 788 | 1 | |
| 783 | 1 |
| Distinct | 782 |
|---|---|
| Distinct (%) | 18.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21.76383279 |
| Minimum | 0 |
|---|---|
| Maximum | 1017.6 |
| Zeros | 675 |
| Zeros (%) | 15.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 6.3 |
| median | 12 |
| Q3 | 26.8 |
| 95-th percentile | 74.845 |
| Maximum | 1017.6 |
| Range | 1017.6 |
| Interquartile range (IQR) | 20.5 |
Descriptive statistics
| Standard deviation | 31.97482945 |
|---|---|
| Coefficient of variation (CV) | 1.469172722 |
| Kurtosis | 233.4925381 |
| Mean | 21.76383279 |
| Median Absolute Deviation (MAD) | 8.3 |
| Skewness | 9.54078445 |
| Sum | 92670.4 |
| Variance | 1022.389718 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 675 | 15.9% |
| 7.1 | 34 | 0.8% |
| 6.8 | 33 | 0.8% |
| 6.7 | 32 | 0.8% |
| 9.7 | 30 | 0.7% |
| 8.5 | 30 | 0.7% |
| 7.3 | 30 | 0.7% |
| 6.2 | 29 | 0.7% |
| 8.9 | 28 | 0.7% |
| 5.6 | 27 | 0.6% |
| Other values (772) | 3310 |
| Value | Count | Frequency (%) |
| 0 | 675 | |
| 2.3 | 2 | < 0.1% |
| 2.5 | 3 | 0.1% |
| 2.6 | 2 | < 0.1% |
| 2.8 | 4 | 0.1% |
| 2.9 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 3.1 | 7 | 0.2% |
| 3.2 | 1 | < 0.1% |
| 3.4 | 4 | 0.1% |
| Value | Count | Frequency (%) |
| 1017.6 | 1 | |
| 380.6 | 1 | |
| 315.7 | 1 | |
| 309.2 | 1 | |
| 262.4 | 1 | |
| 256.1 | 1 | |
| 247.5 | 1 | |
| 242.1 | 1 | |
| 235.6 | 1 | |
| 232.3 | 1 |
| Distinct | 199 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.495302959 |
| Minimum | 0.1 |
|---|---|
| Maximum | 32.3 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.4 KiB |
Quantile statistics
| Minimum | 0.1 |
|---|---|
| 5-th percentile | 0.7 |
| Q1 | 1.4 |
| median | 2.5 |
| Q3 | 4.3 |
| 95-th percentile | 9.7 |
| Maximum | 32.3 |
| Range | 32.2 |
| Interquartile range (IQR) | 2.9 |
Descriptive statistics
| Standard deviation | 3.374013435 |
|---|---|
| Coefficient of variation (CV) | 0.9652992814 |
| Kurtosis | 13.00077819 |
| Mean | 3.495302959 |
| Median Absolute Deviation (MAD) | 1.3 |
| Skewness | 3.016819269 |
| Sum | 14883 |
| Variance | 11.38396666 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1.3 | 145 | 3.4% |
| 1.2 | 144 | 3.4% |
| 1.1 | 137 | 3.2% |
| 1.6 | 127 | 3.0% |
| 1.4 | 114 | 2.7% |
| 1.5 | 112 | 2.6% |
| 1 | 104 | 2.4% |
| 1.7 | 101 | 2.4% |
| 0.8 | 101 | 2.4% |
| 2.4 | 99 | 2.3% |
| Other values (189) | 3074 |
| Value | Count | Frequency (%) |
| 0.1 | 6 | 0.1% |
| 0.2 | 5 | 0.1% |
| 0.3 | 12 | 0.3% |
| 0.4 | 27 | 0.6% |
| 0.5 | 68 | |
| 0.6 | 65 | |
| 0.7 | 87 | |
| 0.8 | 101 | |
| 0.9 | 86 | |
| 1 | 104 |
| Value | Count | Frequency (%) |
| 32.3 | 1 | |
| 28.9 | 1 | |
| 28.1 | 1 | |
| 28 | 1 | |
| 27.3 | 1 | |
| 26.9 | 1 | |
| 26.3 | 1 | |
| 26.1 | 1 | |
| 25.3 | 1 | |
| 25.2 | 1 |
| Distinct | 2659 |
|---|---|
| Distinct (%) | 62.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 202.2023955 |
| Minimum | 4.5 |
|---|---|
| Maximum | 9873.7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.4 KiB |
Quantile statistics
| Minimum | 4.5 |
|---|---|
| 5-th percentile | 31.1 |
| Q1 | 81 |
| median | 146.95 |
| Q3 | 253.775 |
| 95-th percentile | 530.79 |
| Maximum | 9873.7 |
| Range | 9869.2 |
| Interquartile range (IQR) | 172.775 |
Descriptive statistics
| Standard deviation | 240.5725484 |
|---|---|
| Coefficient of variation (CV) | 1.189761119 |
| Kurtosis | 627.5753774 |
| Mean | 202.2023955 |
| Median Absolute Deviation (MAD) | 78.75 |
| Skewness | 17.21083731 |
| Sum | 860977.8 |
| Variance | 57875.15103 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 121.6 | 8 | 0.2% |
| 49.8 | 7 | 0.2% |
| 55.1 | 7 | 0.2% |
| 161.4 | 7 | 0.2% |
| 78.2 | 6 | 0.1% |
| 76 | 6 | 0.1% |
| 56.4 | 6 | 0.1% |
| 84.2 | 6 | 0.1% |
| 62.9 | 6 | 0.1% |
| 94.2 | 6 | 0.1% |
| Other values (2649) | 4193 |
| Value | Count | Frequency (%) |
| 4.5 | 1 | |
| 5.2 | 1 | |
| 5.6 | 1 | |
| 6.2 | 1 | |
| 7 | 1 | |
| 7.9 | 1 | |
| 8 | 2 | |
| 8.1 | 1 | |
| 9.8 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 9873.7 | 1 | |
| 3674.1 | 1 | |
| 1938.5 | 1 | |
| 1868.3 | 1 | |
| 1834.3 | 1 | |
| 1823.3 | 1 | |
| 1773.1 | 1 | |
| 1772.2 | 1 | |
| 1502.4 | 1 | |
| 1498.4 | 1 |
Elevation
Real number (ℝ)
| Distinct | 1364 |
|---|---|
| Distinct (%) | 32.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 824.5162048 |
| Minimum | -30 |
|---|---|
| Maximum | 11890 |
| Zeros | 10 |
| Zeros (%) | 0.2% |
| Negative | 28 |
| Negative (%) | 0.7% |
| Memory size | 33.4 KiB |
Quantile statistics
| Minimum | -30 |
|---|---|
| 5-th percentile | 62 |
| Q1 | 359.5 |
| median | 1016.5 |
| Q3 | 1163.75 |
| 95-th percentile | 1413 |
| Maximum | 11890 |
| Range | 11920 |
| Interquartile range (IQR) | 804.25 |
Descriptive statistics
| Standard deviation | 514.6675164 |
|---|---|
| Coefficient of variation (CV) | 0.6242054594 |
| Kurtosis | 87.4971512 |
| Mean | 824.5162048 |
| Median Absolute Deviation (MAD) | 279.5 |
| Skewness | 4.010219735 |
| Sum | 3510790 |
| Variance | 264882.6524 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1048 | 15 | 0.4% |
| 1041 | 15 | 0.4% |
| 1050 | 15 | 0.4% |
| 1036 | 14 | 0.3% |
| 1009 | 14 | 0.3% |
| 1080 | 14 | 0.3% |
| 1032 | 13 | 0.3% |
| 1060 | 13 | 0.3% |
| 1121 | 13 | 0.3% |
| 1070 | 13 | 0.3% |
| Other values (1354) | 4119 |
| Value | Count | Frequency (%) |
| -30 | 1 | < 0.1% |
| -17 | 1 | < 0.1% |
| -16 | 1 | < 0.1% |
| -14 | 1 | < 0.1% |
| -12 | 2 | |
| -11 | 1 | < 0.1% |
| -10 | 1 | < 0.1% |
| -9 | 3 | |
| -8 | 2 | |
| -7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 11890 | 1 | |
| 11191 | 1 | |
| 4020 | 1 | |
| 2396 | 1 | |
| 2110 | 1 | |
| 2024 | 1 | |
| 1939 | 1 | |
| 1856 | 1 | |
| 1836 | 1 | |
| 1833 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| df_index | Point_ID | Revisited_point | Coarse | Clay | Sand | Silt | pH(CaCl2) | pH(H2O) | EC | OC | CaCO3 | P | N | K | Elevation | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 4 | 34463934 | No | 28.0 | 10.0 | 46.0 | 44.0 | 3.9 | 4.04 | 28.40 | 43.1 | 1 | 6.3 | 2.3 | 38.6 | 315 |
| 1 | 5 | 33983238 | No | 18.0 | 14.0 | 36.0 | 50.0 | 4.2 | 4.41 | 41.80 | 32.4 | 0 | 7.5 | 3.3 | 48.0 | 137 |
| 2 | 6 | 34043240 | No | 20.0 | 18.0 | 35.0 | 46.0 | 4.9 | 5.13 | 32.00 | 21.1 | 1 | 12.4 | 2.1 | 36.0 | 131 |
| 3 | 7 | 33723266 | No | 13.0 | 14.0 | 36.0 | 50.0 | 4.0 | 4.16 | 72.40 | 53.2 | 0 | 52.1 | 4.2 | 158.5 | 137 |
| 4 | 8 | 34203268 | No | 34.0 | 19.0 | 48.0 | 34.0 | 3.7 | 3.87 | 11.63 | 16.0 | 1 | 3.7 | 1.0 | 24.4 | 514 |
| 5 | 9 | 33663268 | No | 28.0 | 8.0 | 71.0 | 20.0 | 4.0 | 3.99 | 22.20 | 16.0 | 1 | 8.3 | 1.1 | 30.0 | 232 |
| 6 | 10 | 34123260 | No | 26.0 | 13.0 | 39.0 | 48.0 | 4.7 | 4.94 | 35.00 | 46.5 | 0 | 49.7 | 3.6 | 153.1 | 377 |
| 7 | 11 | 33723292 | No | 27.0 | 22.0 | 15.0 | 63.0 | 4.6 | 4.79 | 81.10 | 40.3 | 0 | 75.5 | 5.2 | 96.2 | 152 |
| 8 | 12 | 34163274 | No | 28.0 | 4.0 | 79.0 | 17.0 | 3.0 | 3.73 | 44.00 | 506.0 | 0 | 84.1 | 20.5 | 197.2 | 570 |
| 9 | 13 | 33523262 | No | 23.0 | 25.0 | 17.0 | 58.0 | 4.1 | 4.33 | 52.90 | 48.2 | 0 | 14.6 | 4.6 | 103.6 | 168 |
Last rows
| df_index | Point_ID | Revisited_point | Coarse | Clay | Sand | Silt | pH(CaCl2) | pH(H2O) | EC | OC | CaCO3 | P | N | K | Elevation | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4248 | 21016 | 64621644 | Yes | 13.0 | 40.0 | 17.0 | 43.0 | 7.5 | 8.00 | 18.75 | 11.3 | 252 | 55.8 | 1.6 | 921.1 | 42 |
| 4249 | 21017 | 64541646 | Yes | 13.0 | 29.0 | 24.0 | 47.0 | 7.3 | 7.88 | 22.10 | 24.3 | 608 | 78.3 | 2.0 | 775.9 | 216 |
| 4250 | 21018 | 63941630 | Yes | 21.0 | 16.0 | 48.0 | 36.0 | 6.7 | 7.10 | 5.61 | 3.5 | 0 | 0.0 | 0.4 | 102.8 | 1028 |
| 4251 | 21019 | 64981672 | Yes | 34.0 | 40.0 | 18.0 | 42.0 | 7.6 | 7.98 | 15.17 | 18.1 | 180 | 0.0 | 2.2 | 558.2 | 107 |
| 4252 | 21020 | 64121612 | Yes | 29.0 | 28.0 | 28.0 | 44.0 | 7.5 | 7.94 | 16.18 | 13.2 | 570 | 0.0 | 2.2 | 373.1 | 327 |
| 4253 | 21021 | 64841666 | Yes | 6.0 | 23.0 | 55.0 | 22.0 | 7.0 | 7.44 | 31.30 | 6.4 | 2 | 96.7 | 0.9 | 589.4 | 39 |
| 4254 | 21022 | 64841670 | Yes | 17.0 | 48.0 | 21.0 | 30.0 | 7.4 | 8.02 | 19.54 | 11.2 | 17 | 63.4 | 1.5 | 835.6 | 52 |
| 4255 | 21023 | 64161658 | Yes | 15.0 | 33.0 | 34.0 | 32.0 | 7.6 | 8.10 | 19.12 | 5.8 | 85 | 6.5 | 0.7 | 337.1 | 240 |
| 4256 | 21024 | 63921638 | Yes | 23.0 | 19.0 | 46.0 | 35.0 | 7.0 | 7.50 | 10.34 | 5.2 | 1 | 0.0 | 0.5 | 56.2 | 707 |
| 4257 | 21025 | 64421628 | Yes | 14.0 | 39.0 | 21.0 | 40.0 | 7.4 | 7.86 | 18.43 | 5.0 | 765 | 14.3 | 1.5 | 358.5 | 161 |